CDS

Accession Number TCMCG005C30232
gbkey CDS
Protein Id XP_020243981.1
Location complement(join(115485704..115485874,115486468..115486522,115486602..115486648,115486745..115486801,115486895..115487003,115495543..115495624,115497317..115497401,115497475..115497594,115497675..115497725,115498031..115498096,115499410..115499457,115499545..115499593,115499687..115499740,115499857..115499916,115500022..115500080,115500529..115500645,115500741..115500752))
Gene LOC109822226
GeneID 109822226
Organism Asparagus officinalis

Protein

Length 413aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA376608;
db_source XM_020388392.1
Definition flap endonuclease 1-A isoform X1

EGGNOG-MAPPER Annotation

COG_category L
Description Structure-specific nuclease with 5'-flap endonuclease and 5'-3' exonuclease activities involved in DNA replication and repair. During DNA replication, cleaves the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. It enters the flap from the 5'-end and then tracks to cleave the flap base, leaving a nick for ligation. Also involved in the long patch base excision repair (LP-BER) pathway, by cleaving within the apurinic apyrimidinic (AP) site-terminated flap. Acts as a genome stabilization factor that prevents flaps from equilibrating into structurs that lead to duplications and deletions. Also possesses 5'-3' exonuclease activity on nicked or gapped double- stranded DNA, and exhibits RNase H activity. Also involved in replication and repair of rDNA and in repairing mitochondrial DNA
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko03032        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
ko04147        [VIEW IN KEGG]
KEGG_ko ko:K04799        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03030        [VIEW IN KEGG]
ko03410        [VIEW IN KEGG]
ko03450        [VIEW IN KEGG]
map03030        [VIEW IN KEGG]
map03410        [VIEW IN KEGG]
map03450        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005634        [VIEW IN EMBL-EBI]
GO:0005730        [VIEW IN EMBL-EBI]
GO:0031974        [VIEW IN EMBL-EBI]
GO:0031981        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043228        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0043232        [VIEW IN EMBL-EBI]
GO:0043233        [VIEW IN EMBL-EBI]
GO:0044422        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044428        [VIEW IN EMBL-EBI]
GO:0044446        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0070013        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGGCATTAAGGGTTTAACGAAGCTGCTGGCTGACAACGCCCCAAAGGCCATGAAGGAGCAAAAATTCGAGAGCTATTTCGGTCGCAAGATTGCCATCGACGCTAGCATGAGCATCTACCAGTTCCTTATTGTGGTGGGAAGAACTGGAACTGAAACTCTTACCAATGAGGCTGGCGAGGTTACCAGTCATTTGCAAGGAATGTTCACACGCACAATAAGATTACTAGAGGCTGGAATGAAGCCAGCATTTGTCTTTGATGGTCAGCCACCTGACCTGAAGAAACAAGAGCTTGCAAAGAGGTACACAAGGAGAGAGGACGCGACCAAAGACCTAAATGCAGCAATTGAGACCGGAGATAAGGTGGAAATTGAGAAATTCAGCAAAAGAACTGTAAAGGTGACCAAGCAACATAATGAGGATTGCAAAAAACTTCTGAGACTGATGGGTGTCCCTGTGATTGAGGCGCCAAGCGAAGCAGAAGCAGAATGTGCATCACTTTGCAAAAATGGCAAGGTGTATGCTGTTGCTTCAGAAGATATGGATTCATTAACCTTTGGAGCCCCAAGGTTTCTTCGCCATCTAATGGATCCAAGCTCCAAAAAGATTCCTGTAATGGAATTTGAAGTTGCTAAGGTTCTGGCAGAGCTAGAACTGACCATGGATCAATTCATTGACTTGTGTATTCTTTGTGGGTGTGACTACTGTGATAGTATCAAAGGTATTGGGGGGCAAACAGCCTTAAAATTGATCCGTCAACATGGTACAATAGAGACCATATTGGAGAATATAAACAGGGATAGGTATCAGATACCTGAGGATTGGCCATACCAAGAAGCTCGACGCCTGTTTAAAGAGCCTCTTGTCACTAATGACTCAGAAGAGCTTAAGTGGACTCCTCCAGAGGAGGAGGGTCTCGTGAACTTTCTGGTGAATGAAAACGGTTTCAACAACGATCGAGTAGTAAAGTCGATAGAGAAAATTAAAGCAGCAAAGAATAAGTCTTCCCAGGGCCGATTGGAATCTTTTTTCAAGCCAGTTGTGGGTACATCTGTACCTGTTAAGCGCAAGGGAACCCGATGTGCGCTTGGTGATGCCAAATTGGCACTCAAGCCTAGGATGCTTACTCTGCATGTCACTATGTCCTCAGACGCCAAAGTTGGCAGAGTAAAAATGCCAAATCCACAACTTGGTTTCAGATGCCGCAAAACATCTATAGCGGGACTTTGTTTCAGCCGGTGA
Protein:  
MGIKGLTKLLADNAPKAMKEQKFESYFGRKIAIDASMSIYQFLIVVGRTGTETLTNEAGEVTSHLQGMFTRTIRLLEAGMKPAFVFDGQPPDLKKQELAKRYTRREDATKDLNAAIETGDKVEIEKFSKRTVKVTKQHNEDCKKLLRLMGVPVIEAPSEAEAECASLCKNGKVYAVASEDMDSLTFGAPRFLRHLMDPSSKKIPVMEFEVAKVLAELELTMDQFIDLCILCGCDYCDSIKGIGGQTALKLIRQHGTIETILENINRDRYQIPEDWPYQEARRLFKEPLVTNDSEELKWTPPEEEGLVNFLVNENGFNNDRVVKSIEKIKAAKNKSSQGRLESFFKPVVGTSVPVKRKGTRCALGDAKLALKPRMLTLHVTMSSDAKVGRVKMPNPQLGFRCRKTSIAGLCFSR